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Abstract 



We investigate several geometric models of network which simultaneously have some nice global 

properties, that the small diameter property, the small- community phenom,enon, which is defined to 

OO . capture the common experience that (almost) every one in our society belongs to some meaningful small 

O^ ' communities by the authors (2011), and that under certain conditions on the parameters, the power 

law degree distribution, which significantly strengths the results given by van den Esker (2008), and 

Jordan (2010). The results above, together with our previous progress in Li and Peng (2011), build a 

mathematical foundation for the study of communities and the small-community phenomenon in various 

networks. 

r^ . In the proof of the power law degree distribution, we develop the method of alternating concentration 



analysts to build concentration inequality by alternatively and iteratively applying both the sub- and 
super-martingale inequalities, which seems powerful, and which may have more potential applications. 



1 Introduction 

\^ ' With the availability of massive datasets of many real world networks, we are able to observe and study the 

OO . underlying dynamic mechanisms and many interesting phenomena in large-scale networks in a quantitative 

C^ ' way. Some properties such as sparse, high-clustering, hierarchical structure, the power law degree distribution 

and small diameter appear in a wide range of networks, ranging from Internet graphs, collaboration graphs to 



r^ ' PPI (Protein-Protein Interaction) networks. Modeling these interesting properties and phenomena not only 

^-^ ■ provides us a good way to better understand how these networks evolve and why these global phenomena 

occur through local growing rules, but also gives us insights on the development of new technologies or even 
cancer drugs. 

A typical network always simultaneously exhibits several properties. For example, in a Web graph, the 
nodes are web-pages and directed edges are hyperlinks between the pages, the number of nodes with in- 
^ '_ degree k is proportional to k~^ for some constant /?, i.e., the in-degree sequence obeys the power law degree 

distribution ([AJB99> KK R+99| ). It has also been observed that the Web graphs have a small average 
distance ^AJBOQ. ,BKM+00j . In this paper, when it is not confused, "small" means that the quantity is a 
polylogarithmic function of the number of graph nodes. Furthermore, the most community- like subgraphs 
in the large Web graphs turn out to have size about 100, which seems to be a general property in many large 
real networks .LLDMOSJ ILLMlOj . The above mentioned three properties are by no means particular in the 
technological networks, and they are shared also by a wide range of social networks, such as the friendship 
network of LiveJournal. 

The first two properties, i.e., the power law degree distribution and the small diameter property, have 
been explored extensively in the past decades. However, to our knowledge, the third property that good 
communities in large-scale networks have small sizes is still widely open due to the reason that there were no 
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mathematical definition of good communities in network, which motivates us to mathematically study the 
common experiences, or observations, or small experiments of the small-community phenomenon in networks. 

The authors proposed a mathematical definition for communities based on the concept of conductance, 
defined the small- community phenomenon in networks, and conjectured that small communities are ubiqui- 
tous in various networks (referred to our earlier work |LP11) ) . Intuitively, a given network is said to have the 
small-community phenomenon if almost every node in the network is contained in some good community of 
small size (referred to Section [2] for the formal definition). We found theoretical evidence for our conjecture 
that some classical network models (e.g., Kleinberg's small world model jKleOO| and Ravasz-Barabasi Hierar- 
chical model [RB03 ) do have the small-community phenomenon, though models without this phenomenon 
exist. 

There are also other reasons for us to make such a conjecture. Firstly, we all have the common experience 
that everyone in the society belong to some small meaningful groups which may correspond to classmates, 
friends or relatives. Secondly, the existing empirical studies provide us evidence that large communities are 
rare in large networks and good communities are of small size. Besides the direct evidence given by (LLDMOSl 
ILLMlOl IGR09 ■ there are also some implicit evidence. For example, jLanOS) has shown that spectral graph 
partitioning fails to generate highly unbalanced cuts over many large scale social networks and |KB10] 
pointed out that this failure may be caused by the abundance of small dense communities. In summary, we 
have reasons to conjecture that the small communities are ubiquitous, at least in many large social networks, 
which arises a number of new problems in both theory and applications of the small-community phenomenon 
in networks. 

We are interested in evolving models that simultaneously have these "good" properties: the power law 
degree distribution, the small diameter and the small-community phenomenon, which are shared in typical 
Web graphs and large social networks. Models with one or two properties are easily constructed in some 
natural way. In particular, the power law degree distribution arises from the preferential attachment scheme; 
the small diameter originates from a broad class of graph processes |Bol01j : the small community may be 
caused by the notion of homophily that, similar or close individuals have great tendency to associate with 
each other, which is a common reason for two people establish a relationship with each other in our society. 

However, when trying to define a model that unify all the three properties, we usually come across 
conflicts that are hard to reconcile. Not strictly speaking, the first two properties usually result from some 
expander like graphs while the small community corresponds to highly structured graphs which seem anti- 
expander like graphs to some extent |LP11) . Still, Ravasz-Barabasi Hierarchical model |RB03| satisfies 
all these requirements as shown in |LPllj . However, the Ravasz-Barabasi Hierarchical model has a very 
unnatural growing rule, which can only capture very special networks. 

Another good candidate may be the Geometric Preferential Attachment (henceforth GPA) model intro- 
duced by Flaxman, Frieze and Verta |FFV07a] IFFVOTb] . whose motivation was to model networks with 
power law degree distribution and small expansion. This model is defined on a unit-area spherical surface 
S, on which distance can be naturally introduced. The authors of [FFVOTa] combined the rich-get-richer 
effect and the concept of homophily in a simple way that every new comer chooses neighbors only from those 
exiting vertices that are not far from them using the preferential attachment scheme, and proved that the 
power law distribution occurs under some conditions of the parameters in the model. In [LPllj . we have 
shown that good communities exist for every node in a model under these conditions. However, the found 
communities are of relatively large size and the diameter is also large. 

In the present paper, we will first study a base model that is a GPA model with additive fitness. We 
generalize the result of |JorlO| and show that under some appropriate conditions, the base model have both 
the power law degree distribution and the small- community phenomenon. However, in this situation the 
diameter of the model is large. To resolve this problem, we try to incorporate a simple growth rule into our 
base model that leads to small diameter and does not change too much the degree sequence. The rule we 
try to use is the uniform recursive tree, i.e., each time a new vertex chooses a neighbor uniformly at random 
from exiting vertices. It has been well known that such a simple process results in a graph of diameter 
and maximum degree of order 0(lnn), where n is the number of generated vertices |SM95) . We give two 
alternate ways to incorporate this rule. Though the resulted two models are similar, their structures are 



different. Tlie first one is a hybrid model, wliicli can be regarded as a composition of two independent parts: 
a local graph, wfiicfi has tlie power law degree distribution, and a global graph, which may connect vertices 
that are far away. The hybrid model as a whole has the small diameter and the small community structure. 
The second is a self-loop model, in which we treat the additive fitness in our base model as the number 
of self-loops attached with the new vertex. This gives a new interpretation for the use of fitness in the 
preferential attachment schemes. With some further operation, the self-loop model is shown to have all the 
three good properties. 

The methodology we use to show the power law degree distribution may be of independent interest. 
The proof technique is inspired by the work [JorlO] , who investigated the asymptotic behavior of the degree 
sequence of the base model (see Section^. In our proof of the concentration inequalities, there are subtle re- 
strictions on parameters for which deeper mathematics is needed. Rather than using the coupling techniques 
as that in |FFV07al IvdEOSj . we recursively utilize the submartingale and supermartingale concentration 
inequalities |CL06| to give a better bound at each step, which will result in a sharp bound of the desired 
quantity. 

Further related works Avin studied a random distance graph that incorporates both the Erdos-Renyi 
graph and the random geometric graph |Avi08| . This graph is shown to have several good properties, e.g., 
the small diameter and high clustering coefficient et al. Hybrid model composed of a power law graph 
and a grid-like local graph is studied by several groups of researchers, see [CL041 IKBlOl IFG09J . Clusters 
or communities based on the concept of conductance was studied in |KVV04j and |LLDM08l ILLMIO) , in 
which the spectral algorithms and other approximation algorithms were used to detect good clusters or 
communities. 

In Section [21 we will introduce the definition of the small- community phenomenon as well as our models, 
and then state the main results of the paper. In the next three sections, we show that the models have the 
desired properties. In Section [71 we discuss the effect of the choice of a parameter on the properties of our 
proposed models. Finally, we give a brief conclusion in Section [51 

2 basic definitions, the model and main results 

2.1 The small-community Phenomenon 

In a graph G — {V, E), the degree of a node v ^V \s denoted as AegQ{v). The volume of a subset of S* C T^ 
is defined to be the sum of degrees of vertices in it, namely, vol(S') = X^ijgS '^^Sg('*^)- 

Our definition of communities is inspired by the work of Leskovec et al. |LLDM08) . who used the con- 
ductance to measure the goodness of a community. We introduced the concept of (a, j3, 7)-community based 
on the conductance and the size of a set of nodes |LPllj . The conductance $(S') of S is the ratio between 
the number of edges coming out of S and the volume of it or its complement S, whichever is smaller, i.e.. 



min{vol(5),vol(5')} ' 

where e{S, T) denotes the set of edges with one endpoint in S and the other in T . 
Now we formulate the (a, /3, 7)-community as follows: 

Definition 1. Given a graph G — {V,E) with \V\ — n, a connected set S izV with \S\ — uj{1) is a strong 
{a, 13) -community if 

MS) < ^ . (1) 

Moreover, if \S\ — 0((lnn)'^), then we say that S is a strong [a, (3,^) -community. 

Note that in the above definition we require that the size of a community is not too small (i.e., \S\ = w(l)). 
This requirement helps us to avoid the trivial case in our definition (when |S'| is constant, it can always be 



treated as a proper community by choosing large a). In fact, a meaningful community in society always can 
not be too small because of lack of requisite variety or other group function |A1104| . 

To characterize the feature that almost every one in the network belongs to some small community, we 
give the following definition. 

Definition 2. A network (model) G is said to exhibit the small- community phenomenon, if almost everJQ 
node belongs to some (a, P,^)-com,m,unity, where a,/3,7 > are some global constants. 

2.2 The Geometric Model 

The base model we will use is a geometric preferential attachment model with additive fitness. Such a model 
has been studied in [vdEOSl iJorlO) (see also |FFV07al IFFV07bj V Assume that a self-loop counts as degree 
1. The model is defined on a unit-area spherical surface 5* (i.e., the radius of the sphere is j^). Let n be 
the number of vertices we are going to generate. Let ^ > be an arbitrary constant and m,r,6 = ^m be 
some parameters which may depend on n (Note that this is the essential difference from the cases studied 
in [JorlOj ). Intuitively speaking, m is the number of edges we are going to add in each step; r is the distance 
restriction on the two endpoints of an edge; S is the additive fitness. Let Bfi{v) denote the spherical cap of 
radius R around v in S, i.e., Bji{v) — {u E S : \\u — v\\ < R}, where || • || denotes the angular distance on S. 
Let An = a.ica.{Bj^{v)) be the area of the spherical cap of radius R, which is independent of v. 

The base model: We start the process from a graph Gi, which is composed of a uniformly generated 
(from S) node xi with 2m self-loops. At each time t -f 1 for f > 0, if Gt = {Vt, Et), we first generate a new 
node Xt+i uniformly at random from S and then connect it to some existing vertices or itself. Specifically, 
if there is no node in Br{xt+i), then we add 2m self-loops to Xt+i] if Br{xt+i) nVt ^ 9, then we choose 
independently m contacts (with replacement) from Br{xt+i) for the new comer such that for any i with 
1 < i < m, the probability that some vertex v G Br{xt+i) is chosen as the ith contact is defined by 

Remark: in |vdE08) (also in [FFVOTal IFFVOTb) ) . a self- loop parameter a > 2 was introduced to avoid a 
technical problem when proving the power law degree distribution. In their settings, a node v 6 Br{xt+i) is 
chosen as the contact with probability 

degt{v)+S 



T^^^{J2w€BAxt+i)nvM'^Stiw) + S),a{m + S/2)Art} 

where S > — m. The case of a = is left open in these papers. Jordan |JorlO| investigated the asymptotic 
behavior of the degree sequence in the case of a = 0. In his study, m,r,S > are constants that not depend 
on 71, which converges to infinity. However, in our situation, we need a strong concentration result such that 
the parameters may depend on n. We will give such a result when a = and J > 0, which strengths the 
results in |vdE08[ IJorlO) and partially answers the open question in |FFV07a[ IFFV07b| . 

We can show that when S — S,m > and r = rg = n^^ilnnY" , where co = co(^) is large and may 
depend on ^, the base model has the power law degree distribution and the small-community phenomenon 
but does not have the small diameter. To incorporate the missing property while not changing the other 
two properties too much, we introduce some operations that essentially generate a uniform recursive tree. 
We give two different operations such that the resulted two variants of the base model both have the three 
properties to some extent. 

1. The hybrid model: In this model, every edge has an attribute that indicates whether it is a local-edge 
or a long-edge, which indicates that the two endpoints of the edge are local- or long-contacts of each 
other. A local- (or long-) edge contributes to the local- (or long-) degree of both of its endpoints. We 



^almost every means 1 — On(l), where n is the number of vertices in G 



start from the G^ the same as Gi in the above and let the self-loops of xi be local-edges. At each step 
t + 1 for i > 1, to form G^+i from G^, a new vertex xt^i is chosen uniformly at random from S. First 
we choose for the new comer m local-contacts yi^l < i < m, independently at random as in the base 
model with degi(w) in Eq. ^ denoting the local-degree of v at time t. Then we choose for Xt+i one 
other long contact z uniformly from Xi,- ■ ■ ^Xf. 

This model can be seen as composed of two parts: a local power law graph and a global uniform 
recursive tree, which can be generated in two phases: firstly, we can generate the local power law 
graph following the rules used in the base geometric model and then generate a recursive tree as 
follows: sequentially for t > 1, Xt+i connects a long-contact which is chosen uniformly at random from 

Xi,--- ,Xt. 

The independence of the local part and the global part of the hybrid model conforms to our intuition 
that local contacts and long contacts are formed by different mechanisms. Previous studies on such 
a model usually has a global power law graph and a local grid-like graph (see eg. |CL04j V which is 
comparable with ours. 

2. The self-loop model: In this model, every new node is born with S flexible self-loops which may be 
eliminated in later steps. Now we generate xi uniformly at random from S and add 2m + S self-loops 
to it with S > 2 loops marked flexible. This is the start graph Gf. At each step t + 1 for i > 1, to 
form G^^i from Gf , a new vertex xt+i is chosen uniformly at random from S and S flexible self-loops 
are added to it. We first choose m contacts j/i, 1 < i < m independently at random as in the base 
model with deg((i') in Eq. ^ denoting the number of non-flexible edges incident to v at time t. Then 
we choose for xt+i one other contact z uniformly from the set of existing nodes containing flexible 
self- loop(s) (such a set cannot be empty because xt is a member of it) and delete one flexible self- loop 
from both xt+i and z. The newly added edge {xt+i, z) is marked flexible. Note that the edge-rewiring 
keeps the degree of vertices unchanged, which facilitates the analysis of its degree distribution. 

This model can be seen as composed of two parts: a flexible part and a non-flexible part, which can 
be generated in several phases: we first generate the non-flexible part following the growth rules of 
the base model. We then add 5 flexible self- loops to each vertex. Then sequentially for each t > 1, 
Xt+i connects a contact z which is chosen uniformly at random from xi,- ■ ■ ,Xt, containing flexible 
self-loop(s), a flexible self-loop of Xt+i and z is deleted and a new flexible edge (xf+i, z) is added. 

We give a plausible explanation of the self-loops emerging in this model. It is widely studied in 
social sciences that people in our society have not only evident relationships with others, but some 
implicitly one-sided "parasocial" interactions with the celebrities, virtual characters and so on, in which 
relationship only one part knows a great deal about the other, but the other does not IIW56 . Such 
a relationship can barely be reflected by the usually used friendship networks, which mainly coins 
the two-sided friendship. Our model incorporates the parasocial relationships as self-loops and the 
edge-rewiring may be roughly interpreted as that the long-distance relationship is established at the 
expense of its parasocial connections. 

2.3 Main Results 

Our main results arc that the two models have rather good properties. Assume that 6 — m^, where f > is 
some constant and tq = n^^/^(lnn)'^° for some large constant Cq which may depend on ^. 

For r > ro, it is obvious that the diameter of the base model is ri(l/r) = n{n^^'^{\nn)~'^°) (see Section^), 
which is large, while the short diameters of the uniform recursive trees imply the small diameter results in 
our two generalized models. 

Theorem 1. (Small Diameter Property) 

1. For any ?7i > l,r > 0, with high probability, the diameter of G^ is O(lnn). 

2. For m > Ki{(_)\nn and r > 0, with high probability, the diameter of G%^ is O(lnn), where i^i(^) is 
some constant depending on ^. 



By the geometric structure of the models, it is natural to think of that a group of vertices close to each 
other behaves like a good community. We will make this intuition rigorous by considering the R-neighborhood 
Cr(v) of a vertex u, which is the set of all vertices within distance at most R from v in Gn and show that 
for some appropriate r and R, Cji{v) is a good community for every v. We give the following result: 

Theorem 2. (Small-Community Phenomenon) Ifr ~ vq andm > K2{£,)lnn, where ^2(0 ^^ som,e constant 
depending on ^, both G^ and G^ have the small- community phenomenon, i.e., in each model, with high 
probability, for every node v € Vn, there exists some {a, (5, j)- community containing v, where a,/3,7 are 
some constants independent of n. 

A simple corollary of the above theorem is that the base model G„ also has the small-community phe- 
nomenon, which indicates that the community structure is mainly determined by the geometric structure of 
our model and that the effect of long edges is little for the reason that every new node can establish m ^ In n 
local edges while only 1 long edges. 

The power law degree distribution stems from the preferential attachment scheme used in our base model, 
for which we have: 

Theorem 3. (Degree Distribution of the Base Model) In the base model, if r > tq. m = 0(ln n) and 5 — mS, 
for any constant ^ > 0, there exist some constants Ck and ^, such that for all k — k{n) > m, 

77 77 

E[Mt)]=C,^ + Oij-^^) , (4) 

where dk{t) denotes the number of vertices with degree k in the base model Gt, Ck = Gfe(?n,^) tends to a 
limit Goo ("2, (5) which only depends on m,S as k ~^ oo, and /i is some constant depending on ^ and strictly 
less than 1. 

Theorem [3] has already significantly strengthened the results in both van den Esker |vdE08| and Jor- 
dan [JorlO] . The proof of this theorem requires the new technique of recursively bounding the concentration 
inequalities as we will build in Section [6l 

Based on Theorem |3l we are able to show that in our generalized models, the networks satisfy a nice 
power law degree distribution. 

Theorem 4. (Power Law Degree Distribution) For r > ro and m — 0(ln n), the expected degree sequences 
of the local graph of the hybrid model G^ and the whole graph of the self-loop model G^ both follow a 
power law distribution with exponent 3 + ^. More specifically, there exist some constants G^, G| and ^, such 
that for all k = k{n) > m, 

1. in the hybrid model, E[dk{ri)\ = C'^-p^ + 0( /„"a-)n ), where dk{t) denotes the number of vertices with 
local- degree k in G^; 

2. in the self-loop model, E[dk{n)] — CkTSTi + 0( (-„"2-,n ); where dk{t) denotes the number of vertices with 
total degree k in G^ . 

In the above statements, both C^ and G| tend to some limits that depend on m, 6 only as k -^ oo, and /x is 
some constant depending on ^ and strictly less than 1. 

From the above theorems, we know that when r = ro = n^^"(lnri)^°, the two generalized models 
simultaneously have all the three properties to some extent (as in the hybrid model, only the local part has 
the power law degree distribution). What about the cases when r is too large or too small? We give some 
evidence that at least one of the three properties disappears in such cases. In particular, when r is large, we 
have the following new phenomenon. 

Theorem 5. (Large Community and Small Expander) In the base model G„, let r = n~^''^''^'^ , where e > 
and m > Klnn, for some .sufficiently large constant K. 



1. If R = n^^/^^P, for any p> e, then \Cr{v)\ = 6(71^'') and ^{Cr{v)) = 0{j^), with high probability. 

2. With high probability, for all R = o{r), <^{Cr{v)) = fi(l). 

Theorem [5] indicates that when r = 71^^'^+"^, there exists some large community for every node, which 
may not befong to any small community for the reason that the most natural candidate, i.e., the small 
neighborhood is not a good community. We remark that Theorem [5] may imply a new phenomenon in 
networks. It would be interesting to find some real world networks, in which there is a large fraction of nodes 
each of which is contained in both a good but large community and a small expander. We also note that the 
two generalized models have the same phenomenon for such a large r. 

In the remaining sections of the paper, we are devoted to proving our main results. Theorems [U [2l [3l IH 
and[5l We will organize the paper as follows. In Section [3l we introduce some basic tools for our proof, and 
basic properties of our network models. In Sections |4] and [Sj we prove Theorems [1] and [21 respectively. In 
Section El we prove Theorems |3l and IH In Section |7l we prove Theorem |5l Finally in Section |8l we discuss 
some further issues following the results in this paper. 

3 Useful tools and basic facts 

Before proving the main results, we first give several basic facts which will be useful in our proofs of the 
main results. 

We will use the following form of the Chernoff bound (see eg. Theorem 1.1 in |DP09| ). 

Lemma 1. If Xi, ■ ■ ■ ,Xt are independently distributed in [0, 1] and X — X]i=i ^i' ihen for < C < 1, 

Pt[\X - E[X]\>CE[X]]<2e'^-^. (5) 

The following submartingale concentration inequality will be used extensively in our proofs (referred to 
Theorems 2.38 and 2.41 in ICL06| 1. 

Lemma 2. Suppose that {Xq, • • • , Xt} is a sequence of random variables associated with a filter {J^q, ■ ■ ■ ,Tt} 
and Q is some event on the probability space. If for I < i <t, 

E[x,\T,^i,g]<x^^i, 

Var[X,\T,-i,g]<al 
X, -S[X,|J-,_i,a]<M, 

where (jf,M are non-negative constants. Then we have 



Pr[Xt > Xo + A] < e '^^'=1 -,^+^"/3 + Vv[-^g]. (6) 

The supermartingale concentration inequality is similar and we omit it here. 

In the following sections, we will use constants co, ci and C2 which may depend on ^ to characterize some 
bounds. We state here the conditions that the three constants should satisfy. 

(co - ci - 1)(1 - l/(e + 2)) < ci < 2(co - ci - 1)(1 - 2/(2 + S)) (7) 

ln(g(l+e/2) + l) 
^' ''Mn((7 + 400/O2(m+C/2) + l)) ■ ^ ^ 

Note that for fixed ^ we can always choose co to be large enough to guarantee that C2 is also large, which 
will ensure that the bounds we obtain in the proof are good. 

In the definition of our base model, a new vertex will create 2m self-loops if there is no existing vertex 
within distance at most r from it. This rule is made to guarantee that at each step the degree of the 



graph grows by 2m, which facihtates further analysis. Moreover, in most interesting cases when r = tq ~ 
ri~^/^(ln?i)'^", if t grows as large as t = Q( |-inni"co-i ), then with high probability for any vertex that conies 
after time r, there will be many existing nodes within distance at most r from it. Therefore we will focus 
on the processes that all the later comers will choose existing nodes as neighbors other than creating 2m 
self-loops. 

In analyzing the degree sequence of our base model, it is convenient to compare the chosen probability 
given in Eq. © with the traditional case (eg. |BolQ3| ). in which at each step t + 1 an existing vertex v with 
degree k is chosen with probability ^, where 2i is the total degree of all existing vertices. Thus it is natural 
to consider of using a good estimation of ^ for further analysis. In particular, we would like to have some 
good bound on the normalized quantity of the denominator of © . Let Tt (u) denote this quantity, namely, 
Tt{u) = J2veB^iu)nVt('^^Stiv) + S). A closely related quantity is Zt{u) = J2veBr{u)nVt ^' which is the number 
of vertices in Br{u) at time t. We have several simple facts on these two quantities. 

Lemma 3. If u €z S and t > 0, then the expectation ofTt{u) is Ar(2m + 6)t. 

Proof. Note that 

E[Tt{u)] ^ E[ Y. {degt{v)+5)]=E[Y,{degt{v)+d)l,eBAu)] 
veBriu)nVt veVt 

= E[Y degt{v)l^eBAu)] + S^rt . (9) 

veVt 

The first part of ^ is 2Armt as given in Lemma 1 and 2 in |FFV07a] . which completes the proof. D 

Let Ar denote the area oi Br{v). Then Ar = area(i?r(w)) ^ ?'^/4, for r — o(l). Let tr = — 2"i" c"co) — ^^^ 
thus A^ir ~ 3(lnn)2(nr2)'=i/=o. We will consider that r > tq = n-^/^(\nny° andletto := Uo = (inn)ii^4"^^i-^ ■ 
We first give an estimation of the quantity Zt{u). 

Lemma 4. If r > rg, then for any t > tr, with probability at least 1 — 2r7,^'"", we have that 

\Ztiu)~Art\<j-^^At. 

Proof. Noticing that Zt{u) — J2i=i ^xteBr-iu) and that Fill^.^Briu) = 1] = ^r, we can obtain the result by 
simply applying the Chernoff bound. D 

From the above lemma, we can give a rough bound on Tt{u). 

Lemma 5. If r > vq, then for any t > tr, with probability at least 1 — 4n^'"", we have that 

(1 - (,,2)l/2co )(l + OrnArt < T,[u) < 4(1 + ^^^,^^^^,J i2 + O^A. (10) 

Proof. The left inequality is obvious by using the trivial relation that Tt(u) > m{l + ^)Zt and the bound on 
Zt given in Lemma SI 

To see the right inequality, we note that the sum of the degrees of vertices in Br{u) is equal to the sum 
of out-degrees of all vertices in Br{u), which is equal to mZt, plus the sum of the in-degrees of vertices in 
Br{u), which is at most the sum of out-degrees of all vertices in B2r{u). Therefore, Tt{u) < (m + 5)Zt + 
"^E^ev.ns.^c.)! < (2"^ + '5) E.„ev4nB.„(n) 1 < i2m + S)A2rt{l + j:;^^^^) = 4{2 + ^)mArtil + j^^^^^^j^), 
with probability 1 - 2n"'"". D 



4 Small Diameter 

It is obvious that the diameter in the base model is at least fl{l/r) = n{n^/^{\nn)^'^°) for all r > tq,, 
since any vertex can connect nodes that within distance at most r from it and the maximum distance of two 
vertices is n{l). However, with the addition of the ability to choose uniformly from the subset of previous 
vertices, the diameter can be reduced to O(lnn), with high probability. We will use the following classic 
result on the diameter and the maximum degree of a uniform recursive tree. 

Lemma 6. With high probability, the diameter and the maximum degree in a uniform recursive tree is 
e(lnn). 

Proof. This is a classic result, for which a proof is referred to such as |Pit94[|DL95j . D 

Now the diameter of the two generalized models can be bounded as follows. 

Proof of Theorem [II We consider the two models separately. 

1. In the hybrid model, no matter how the local graph grows, the global graph is the same as the uniform 
recursive tree, which gives an upper bound O(lnn) on the diameter of the whole graph. 

2. For the self-loop model, the constructed tree in the flexible part are restricted to having degree at 
most S and thus may be different from a uniform recursive tree. However, by Lemma [6l the maximum 
degree of a uniform recursive tree is Linn, where L is the hidden constant in 0(lnn), from which we 
know that ii 6 > Lhin, then with high probability, the constructed tree in the flexible part is the same 
as the uniform recursive tree. Therefore, the diameter of the self-loop model is again upper bounded 
by O(lnn). Finally, we note that S = m£^ > Linn is equivalent to ni > Llnn/S,, which completes the 
proof. 

This completes the proof of Theorem [TJ D 

5 The Small- Community Phenomenon 

In this section, we consider the community structure and we will require that r — r^. We start from 
the intuition that a group of people close to each other form a good community, which can be thought 
of geographical communities QKBlOj ). In particular, for a node v, we define the i?-neighborhood Cn{v) 
of V to be the set of vertices within distance at most R from v in Gn, i.e., Cr{v) = Bb.{v) fl Vn- Let 
Rq — n~^'^(lnn)^'^'', we will show that Cr^{v) is a good community. In this section, we will assume that 
m > K2{^) Inn, where ^2(0 is some large constant depending on ^. 

Note that given v, the probability that a node generated uniformly at random from S will land in Bn^ [v) 

is Afl,, ^ ^o/4 = ""^ — . Using the Chernoff bound, it is easy to show that with high probability, the 
number of nodes in C^q is 0((lnn)*'^''), which means that the size of such i?- neighborhood is small. Now we 
consider the connectivity of the subgraph induced by CRg{v). 

Lemma 7. In the base model, if r = r^ = n^^' ^(Inn)"^", then for any v G Vn, the R^-neighborhood Ciig{v) 
induces a connected subgraph in Gn with high probability. 

Proof. We will first show that for every v, 6*^/2 (w) induces a connected subgraph in Gn with high probability. 
The lemma then follows from the fact that any two vertices u, u' in Cij^ (w) can be connected by a set of 
paths between vertices u — vi, V2, ■ ■ ■ ,Vk — u' such that each vertex pair (w^, ti^+i) is within distance r /2. 

Now we consider the connectivity of Cr/2(f). 

Let ArT = 12 Inn, and thus T — /j^^ \^;i"„-i . Let Hq be the subgraph induced by nodes within distance 
at most r/2 from v at time T. Now let a;^^ , • • • , a;^^ be the nodes that land in B^/2{v) after time T and 
let Hg be the corresponding subgraph when vertex xt^ is added in Bj./2{v). Since every vertex Xj will land 
in Br/2{v) with probability A^/2, we know that with high probability, for t > T, the number of nodes 



in Bj./2{v) will be in the range [Ki^r/2^7 ^2^7/2^] for some constants ki,H2- In particular, we have that 
\H^\ < K^A^jiT = 3K2lnn and Ki^,./2is < \iis\ < K2A/2^s- 

Now let Xs be the number of connected components of Hg and let Yg be the number of connected 
components of Hg connected to Xt^^-^- Then we have 

Xs+i = Xs - K,, + 1, Xo < 3k2 Inn . 

We show that if s < 6K2ln7i, Xg decreases by at least 1 for every s > 1 with probability at least ^, 
from which we know that the probability that i?6K2 inn is not connected is bounded by 0{n^'^) and then the 
Lemma follows from the fact that each later coming vertex xt^^-^ such that s > 6^2 Inn will connect the Hs 
with probability at least 1 — 0(n~^°). 

Let £ denote the event that for any u G Vn and for each t > T, Tt{u) < 32(2 + ^)mAr/2t, then as in the 
proof Lemma[Sl the probability that £ holds is 1 — 0{n~'*') . Now Conditioned on £, for each 1 < s < 6k2 In n, 
since Xt^ is in B^/2{v), we have that \xt^ —u\<r for every vertex u € -ff^-i and thus Xt^ will connect u with 
probability at least 

m + S 1 



Tt^-i{xtJ 32Aj./2ts 
Therefore, the probability that xt^ will not connect any vertex in Bj,/2iv) is 

\Hs 



Pr[Ys = 0] < (1 - '/' r < n 
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where the last inequality follows from the fact that m > i^2(C) Inn. 

Now we consider the case that Hg has at least two connected components, namely, Xg > 2. The 
probability that Xt^^-^ will connect at most one component is that 

Pr[n - 1|X, > 2] < 2(1 - / r < 1/10 , 

where we used the fact that 32A^/2*s > 96 Inn and that m > K2{£,) Inn. 

Therefore, Xg decreases by at least 1 for every 1 < s < 6K2lnn with probability at least jt:, which 
completes our proof. D 

Now we show that the conductance of Cr,, (v) in each model is small. 

Lemma 8. In both the hybrid model and the self-loop model, with high probability, for any v Cz Vn, we have 
that 

'^(^-W)^o(^,-^) . (11) 

Proof. We first consider the hybrid model. For convenience, we abbreviate Ci?p(w) as C. Let e(C, C) denote 
the set of edges that connecting C and its complement. Let ei(C, C) and 62(6*, C) denote edges in e(C, C) 
that are local and long, respectively. Then we have: e(C, C) = ei(C, C) U 62 (C, C). 

Local edges connecting C and C must lie between the two spherical segments separated by the boundary of 
Cflo (v). More specifically, if e = (u, w) G ei(C, C), then one of w, w lies on the strip stri = Bf{g^r{v)\BRg (v) 
and the other point lies on the strip str2 = -Bjjp_r(w)\i?i?o(^)- With high probability, the total number of 
vertices in stri is at most n{2rRo + r'^) and the total number of vertices in str2 is at most n{2rRo — r^). Hence 
the number of local edges that lies between the two strips is at most AmnrRo, namely, |ei(C, C)\ < AmnrRo. 

Now we consider the long edges that connects C and the rest of the graph. We will show that the number 
of such edges is relatively small compared with the local edges therein. More precisely, we have the following 
lemma. 

10 



Lemma 9. In the hybrid model, let Yt denote the sum, of the long-degrees of vertices in BRg{v) ClVt- Then 
Yn < cAf/^n for some constant c, with high probability. 



Proof. By definition, we have the following recurrence for Yt 



t- 

E[r.,,|y.] = r. + A,„ + ^^^)^^. (12) 

Let Ab„T = 12 Inn, and thus T = ^^ — ^wrrrr- Let F denote the event that for all t > T, the relation 
\BRg{v) n Vil G [niAngt, K2Ajigt] holds for some constants ki and K2 and that the maximum long-degree of 
vertices xi,- ■ ■ ,xt is Linn. By Lemma |6] and the Chernoff bound, we know that Pr[J'] > 1 — 0(n~^). 

Now we know that for t > T, 

E[Yt+^\Yt,T] <Yt + An, + ^^^ , 

from which we have 

E[Yt+i\Yt,J^]-il + K2)AR,{t+l)<Yt-{l + K2)AR,t . (13) 

Conditioned on J-, we know that the number of vertices in BRg{v) n Vt is K2ARgT < 12k2 Inn and every 
vertex in this set has degree at most Linn, from which we know that Yt < 12K2L(lnn)^. Now define 



X.^ 



Yr - (l + K2)Ar,T i0VT>T+l , 



12K2L(lnn)2 for r = r . 

By inequality (|13p. Xt, ■ ■ ■ ,Xt forms a submartingale with error 0{n~^). We also have that for t > T, 

X,-E[Xr\Xr-l]<l , 



and 



Var[X^|X^_i] = Ysir[Yr\Xr-i] 

< {1 + K2)Ar, . 

Now we apply the submartingale concentration inequality as in Lemma [H we have that 
Pi[Xt >Xt + X\ < e ^(SUt+i'i+c2)^«o+V3) ^ 0(^-3) 



< g 2t(l + C2)A«„+2A/3 +0(^-3) 



Let A = c' ^InnARgt for some constant c'. Then 

PT[Xt >Xt + c'y/lnnARj] < 0{n-^) . 
Finally, using Xt — Yt — {I + K2)AR„t, we have 



PT[Yt > (1 + K2)AR„t + cVln uArJ + 12K2L(ln nf] < Oin-^^) . 
In particular, with high probability Yn < cAr^u for some constant c, which completes the proof. D 
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By Lemma 9, we know that |e2(C, C)\ < cAr^^ti. 
Thus, the total number of edges between C and C is 

|e(C, (7)1 = 0{mrRon + Rln) . (14) 

The volume of C is at least m\C\ ^ mRQU, which means that 

Finally, we briefly discuss the case in the self- loop model. Let 5 > ^i(C) Inn, then with high probability, 
the constructed tree in the flexible part of the model is a uniform recursive tree as the same as that in the 
proof of Theorem [TJ Therefore, the edges that connect an i?- neighborhood and its complement can be also 
bounded by the same argument as that in the case of the hybrid model, which then gives the same result 

as (HSl). n 



Now we can show that the two models have the small-community phenomenon. 

Proof of Theorem\^ For each v G Vn, the i?o-neighborhood Cb.o{v) is of size Q{{h\n)'^'^^'). By Lemmas 
7 and 8, we know that C/jg(v) is an (a, /3, 7)-community of w, where a is the hidden constant in term 

O f IP — ,\n/4^p j in Eq. ([T5|) . ji = l/4co, and 7 = 4co. This completes the proof of Theorem [2] D 

Note that the proof of Theorem [2] also implies that the base model Gt has the small-community phe- 
nomenon. In fact, in this case, we do not need to consider the effect of the edges generated in the uniform 
recursive tree, which simplifies the analysis. We can easily show that the i?o-iieighborhood Cfl,o(w) has 
small size, induces a connected subgraph and has conductance <^{Cb.o{v)) = 0( "^^^'^" ) = 0((lnn)~'^") = 

'^(ic — fernT*) — Jc — fa) 1 1/4 1 i-6-; every node in the base model is contained in a (a', 1/4, 4co)-community. 

6 The Power Law Degree Distribution 

In this section we prove Theorems [3] and HI In Subsection 16.11 we prove Theorem |3] by assuming a concen- 
tration inequality of the degree sequence, in Subsection 16.21 we develop an alternating concentration method 
to prove the concentration inequality desired, and in Subsection 16. 3i we prove Theorem |4l 

6.1 The degree sequence on the base model 

To prove Theorem[31 we analyze a recurrence on E[dfc(t)] as usual. Recall that Tt{u) — X^ues {u)r]V (^eSt(^)"l" 
5). As mentioned above, we will first give a good estimation of Tt(u) and show that Tt{u) concentrates around 
its expectation, building on which we can derive the degree sequence from the recurrence on E[dfc(i)]. 

Recall that t^ = — 2"i"c"co) — for any r > tq. We have the following concentration inequality of Tt{u): 
Lemma 10. (Alternating Concentration Theorem) If r > tq, then for all t > t^, we have that 

Pr[|r,(u) - (2 + i)mArt\ > ^-^^-j^mA^t] = 0{n'^) , (16) 

where Ci,C2 are some constants satisfying the conditions in Eq. O) and ^. 

Lemma [TU] is one of our key technical contributions in this paper which is interesting by its own. To prove 
it, we will need to develop an alternating concentration method, allowing us to alternatively and iteratively 
apply both the submartingale and supermartingale inequalities to prove a desired concentration result. The 
full proof of Lemma 10 is given in Subsection [ 



The role of Lemma [TU] is to give a good estimation of E 



\Gt 



Tt(xt+i) 

E[rffc(t)]. In this subsection, we prove Theorem [3] by assuming Lemma [TOl as follows 
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to analyze the recurrence of 



Proof of Theorem^ Define Dfc(i) := {v e V{Gt)\dega^{v) = k}. Then 4(0 = l^fe(OI 
The recurrence for the expectatfon of dk(t) can be written as fohows. 



E[dk{t + l)\Gt] 

= dkit)+ Y^ (mE 
veDk-i{t) 



(k- l+^)l|:c(+i_t,|<r 



Y^ I mE 



t.G_Dfc(t) 



^((Xt+i) 



Ttixt+i) 



\Gt 



Gt 



+ Oimnr,kiGt,xt+i)\Gt]) , 



(17) 



where r]kiGt,Xt+i) denotes the probabihty that a parallel edge from the new vertex xt+i to a vertex of degree 
no more than k is created, which is at most 



'^ \ X ^ X ^ /. ^N 2 / \v — xt+i\^7 






Ttixt+1 



Now for t > tr, let At denote the event that \Tt{u) — (2 + £^)mArt\ < -. — 2^:572^ "*^r-i- By Lemma [TUl we 
have 



Pr[A] ^l-0{n-^) 



Therefore for t > tr 



E 



E 

veDf^it) 



(fc + ^)l|xt+i-t)|<r 



Ttixt 



+1) 



E 



jk + S) 
{k + S) 



■\p (fc + (5)l|3.^^^_.„|<,r / 

^^^^ (2 + OmArt \ 



Oi: 



)]\At 



Pr[A] + 0(n-2) 



l + 0( 
l + 0( 



1 



1 



(2 + ^)mt V ^(^^2)c2/2co 



(nr^)'^2/2co • 

))E[4(t)|^]Pr[^]+0(n"') 

) ) (E[4(t)] - E[dfc(i)h^] Prh^]) + 0(n-2) 



(fc + 5)E[dfc(i)] 



o(- 



1 



(2 + C)TOi ^(„^2)C2/2co^ 



Similarly, we have 



E 



E 

t;G-Dfc-i(t) 



(fc- 1 + 5)l|xt+i-t)|<r 



T't(xt+i) 



jk - 1 + SMdk-ljt)] 

(2 + O^i 
The error term can be bounded as follows. 

E['qk{Gt,Xt+i)] 



Oi 



(nr^)'^2/2co 



) 



< ,;,E 



E E c^+^r 

i=m v£Di(t) 



2 I ^\v-xt+i\<r 
Tt{xt+l) 
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< (^lE 



E E (^• + ^)';;;iCT^ + Q( ^,.2W2co ) 



■i=m veDi{t) 



' (rir^)'^^/^'^'' ■ 



+ 0(n-2) 



, o,S±f ,,0,O 



If fc + ,5 < /co(t) - („r2)ci/2co-c,/4co^ thej, E[,7fe(Gt,a;*+i)] = O( (i„„).(„t^)c,/.co ) and E[m,7fe(Gt,Xt+i)] = 
O( („,.2)c2/2co ) given the fact that m = 0(ln^ n). 

Let dfc(i) := Fj[dk(t)]. Now the recurrence can be simphfied as 



dkit+l) = du{t) 



{k + 5)dk{t) , {k~l + 5)dk-i{t) 



(2 + 0^ 



(2 + 0^ 



+ lfc=m + 0(- 



1 



:) • 



We define a new recurrence related to (fT8| . For j < to, let /j = and for j > m, let 
which has solution that /,„ = r>+(+m+s ^"^^ ^°^' fc > to, + 1 , 

f - rr ^_i±i_f 

•^' - 11 2 + e + J+r'" 

r(fc + (5)r(TO + 4 + g + .5) 2 + g 

r(3 + e + A: + S)T{m +l + S)2 + C + m + S 

_ (^fc(TO, (?) 

where (?!)/c(to., (5) tends to a limit (jjooirn, 6) which depends on m, 6 only as fc — t- oo. 
Now we show that 

, - , , , / n + Lt 

\dkit)-ht\<M[tr+ ^^^,^^^j,^^ 

where M is some large constant and L is the hidden constant in term 0{-, — 2^7572^5") in Eq. P^ . 
We prove ll20|) by induction. 



(18) 



(19) 



(20) 



1. First note that for t < tr, the above relation holds trivially; for t > tr and k > ko{t), the inequality 
follows from the fact that dk{t) < 2mt/k. 

2. Now assume that t > tr and k < fco(i), we have 



|J(fc + i)-A.(t + i)| 

urn f(f^^^ (fc + '^K(t) (fc-i + J)4„i(t) 
Mfe(t) - Mt + 1) - (2 + 0^ "^ i^TWt ^ 



(jiy'i.\C-2 I 2cq 



)l 



\hit)-ht- ^ 2 + e ^-^ 2 + ^ 
(fc + (5)dfe(t) , (fc -1 + 5)4-1 (t) , 



(2 + Oi 



(2 + Oi 



0( 



(nr2)'=2/2co ' 
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< II- 

< M (t, 

< M (t, 



k + 5 

i^ + Ot 

n + Lt 



\Mt) - Ml + ^-^\d,-^{t) /._it| + o(--i— -) 



(2 + C)i 



' (nr^)'^2/2co ■ 



n + L{t + l) 



L 



(fl1-'2^C2/2co 



{nr^)c2/2co 
This completes the induction and the proof of Theorem |31 D 

6.2 Estimation of Tt{u) - Alternating Concentration Analysis 

In this subsection, we prove the Alternating Concentration Theorem, that is. Lemma [TOl 

As mentioned above, Flaxman et al. |FFV07ai IFFVOTb) and van den Esker |vdE08| introduced a new 
parameter a > 2 to facilitate the analysis and they used the traditional coupling technique to bound Tt(u). 
In our settings, we do not use the additional parameter a and we can still get a nice bound. Our idea is 
to develop a refined method based on the recurrence directly implied in the definition of Tt{u). By using 
this recurrence, we can start from the weak bound as given in Lemma [S] and iteratively improve both the 
upper bound and the lower bound of Tt{u). This improvement can be done by using the submartingale and 
supermartingale concentration inequalities as in the proof of Lemma [S] This allows us to show that the 
accumulated error in the whole process is small and therefore guarantees the desired bound. 
At first, we show that a lower bound can be achieved from a rough lower bound on Tt{u). 

Lemma 11. Fix r > rQ. If for any t > tr, 



Pv[Ttiu) < ibi - 



(ll'p2\ci/2co 

for some hi G [1/2, 1) and ri ~ o{{nr'^Y'^''^'^"), then for any t > tr, 



){2 + 0mArt] <ei , 



Pi-[Tt{u) > {bu + 



-){2 + £,)mArt] <e„ , 



— — ^ j^ G (l,oo), r^ = 7 + 40r;/^, £„ = ne; + 5n i""+i and Ci is some constant satisfying the 



(llf2\ci/2co 

where 6„ = ^ 

condition given in Eq. (f^. 

Proof of Lemma \ll\ We will mainly use the following recurrence. 

E[Tt+,{u)\Gt] = T,(u)+m(l+OE[l|,,^,_„l<,|Gt 



(21) 



(22) 



veVt 



where 



Pr[y*+^ = v\Gt] = E 



(degt(w) + (5)l|^^^^_„l<^ 



Tt{xt^ 



Gt 



Let Q denote the event that for all t > tr, the following inequalities hold: Tt{u) > {hi — -. — 2^fjy2^)(2 + 
S,)mArt and (1 - j;^^^j^^j2^){l + O^Art < Tt{u) < 4(2 + OmA,(l + ^-^^l^^^). Then by Lemma[5]and the 
bound given in (f?!]) . Pr[-iC/] < nei + 4ri^'""+^. Conditioned on Q, for t > tr, we have 



Pv[yl+'=v\Gt,g] < E 



(degtiv) + S)l^^BAxt+i) 



ibi 



degt{v)+S 



^i^j^){2 + 0mArt 



\Gt,g 



(bi 



(nr^) 



k72^){^ + Omt 



< 



degt(i') 



r(l 



Ari 



hi{2 + S.)mt^ [nr'^yi/2co 



) 
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Therefore, 

1 4r; 

where the second inequahty uses the rough upper bound on Tt(u) in Lemma [5l 
Let bu = ^^t^ i , and s = 40r;/^, then 

E[T,+i(u)|Gt, g] - {bu + (^^2)l/2co )(^ + OmAAt + 1) 

< (i + ^;(^)(^*(-)-(^"+(;^^^)(2 + a-^^*) • (23) 



Now define 



^^ ^ tor I > tr 



Xi — ■{ n3=t^(l+6,(2+£)j) 

Ttju) - {bu + („^.)^,/.. J (2 + OmArU iOTi=tr . 

From inequahty ^7^, we know that E[X^\Gi-i,Q] < Xi^i for U <i <t. Let Aj = ]!}=*, (1 + b^JTW' 
{J-y/f>i(2+i)_ We have that 

X^-E[X^\G.^„g] = T.i-)-mi-)\G.-ug] < (2 + ^)^ , 

and 

Var[T,(«)|G,_i,g] 



Var[X,|G,_i,a] 



< 






Af_i 



^'-i(") -I- r^ -I- 1 + 40ri \ A 

< (2 + 4)rn 



Ati 



- Af_, 



Therefore, the sequence Xt^, ■ ■ ■ ,Xt satisfies the conditions in Lemma [5] with Pr[-iC/] < ne + An i""+i 
and 



Y, Var[X,|G.-i,g] 





i=t^+l 


< 


^ {i + ZYm^A, 

^ A2 , 


^ 


* (e + 3)2m2A.t?/^'(2+«) 






^ 


/ TnA^^t^ ,2 




( Inn ^ ■ 



(24) 
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The last inequality can be seen by using the fact that Artr '^ 3(lnri)^(nr^)'^i/'^'', the assumption that 
(co - ci - 1)(1 - l/(e + 2)) < ci. Specifically, 

f. ii2/bi{^ + 2) = f, then 

Z^ i2/h{2+0 - ^ ,r\irii M^t J In In n ^ " ^ Inn ^ 

2. if 2/fe,(C + 2) > 1, then 

3. if 2/6KC + 2) < 1, then 

* (-<; I o\2™2/l .2/6,(2+5) J. ™2/12j-2 

^L.^^ ,2/b,(2+«) ^ ^^"^ ^rt{ ^ j j - ^l^^^^(^)i_2A(C+2) ^ 



3(lnn)^+^^^"W5j)(nr2)'^/""+^"^/"°"^^'^"WF^) 

< ^ ■ < ( ) 

3(lnn)^+^^^"WF5))(^r2)ci/co+(ci/co-i)(i-^j|2j) - Inn 

If we let A = 2m Artr, then using the submartingale concentration inequality, we have 

PT[Xt > Xt^ + A] 



< ne, + 5n-'""+i . 

On the other hand, we have that Xt^ < hmArtr conditioned on Q. Thus, Af_i(Xi^+A) < 7{^Y/^'^'^+^'^mArtr 
^^^^1-1/6,(2+5)^^^^^ < 7 — 2-|Ii/2co rnArt, where the last inequality follows from the assumption that (2co — 
2ci - 2)(1 - 2/(2 + 0) > ci. Therefore, 

Pr[T,(u)>(6„ + ^-i±^)(2 + OmA.i] 

^ p^^^ ^*(") - (^" + („.^)^./^-o )(2 + O^Art ^ ^^^ ^ ^^ 
~ At_i ~ 

< ¥v[Xt>Xt^+\] 

< ne/ + 5n-'""+i . 

The proof completes by letting r„ = 7 + 40n/^ and e„ == ne; + 5n^ inn+i^ q 

Similarly, from a rough upper bound, we can obtain an upper bound on Tt{u). 
Lemma 12. Fix r > rg. If for any t > tr, 

PriTtiu) > (6„ + ^-I^){2 + OmArt] < e„, (25) 
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for some b^ £ (1,4) and r„ — o{{nr'^Y'^''^'^°), then for any t > to, 

Pr[T,{u) < [hi - ^^J^l/.J i^ + OmAA] < Q, (26) 

where bi = ^ _ i G (1/2, 1), r; = 7 + 40r„/^, ei = ne„ + 5n^ inn+i^ ^^^^ ^^ ig some constant satisfying the 
condition given in Eq. Q). 

Proof. The proof here is similar to proof of Lemma [TTJ Note that we should instead use the supermartingale 
concentration inequality and let Q' denote the good event defined similar to Q in the above proof, which will 
lead to the following recurrence. 

E[r,+i(u)|G„(7'] > Uu) + mil + OAr + ^;j^^{^ - ^Jy^i,^^ mu) 

Let bi — „J^_ i , and s' — 40r„/^, then 

E[Tt+,{u)\Gt,g'] - {bi - ^-^-j^){2 + 0mAr{t + l) 

+ (^+e + l-&.(2 + + (--^'-5r„ + .'(2 + 0)^^^^3^)™A. 



We then define the corresponding supermartingale X'^^ ,■■■ ,X[ using the above inequality. In this case, 
we will also use the conditions ([7]) on the constants co and ci. Then by setting A' = ^mArt, where A' 
corresponds to the parameter A in Lemma [TTl and using X[^ > ^niArtr, we will get 



Prmu) < ik - ^J^+J^J {2 + OmAt] 

< Pt[X', < Xi ^ X'] 

and complete the proof by letting r; = 7 + 40r„/^ and e; — ne^ + 5n~'""+'^. D 

Now we are ready to prove Lemma 1101 Intuitively, we will iteratively apply the above two lemmas and 
show that if we start with a rough lower bound li, then by Lemma [TTl we can get an upper bound u, from 
which we can again get a new lower bound I2 by Lemma Il2l We prove that I2 > h, which means that we 
get a better lower bound in every iteration. The same holds for the upper bound. 



Proof of Lemma \l(A If — ^2+; > 4, then we start our iterative process from the rough upper bound in 

2+5- j^ 

Lemma [5l Otherwise, we start the process from the rough lower bound. 

Assume we start from the rough lower bound, and the case of starting from the rough upper bound is 
similar. By LemmaEl we know that for all t > tr, Tt{u) > (1 — -, — 2-)ci/2co )(1 + C)"^^r^ with probability at 
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least 1 — 4n '"". We define the start point of our iterative process by letting h\ — |^ S (1/2, 1), r] ' — 1, 

For i > 1, assume that we have that Tt{u) > {bi' — -. — ^^75^) (2 + (,)mArt with error probability ei' 
for any t > tr- Now we substitute the corresponding parameters in Lemma [Til to give an upper bound that 
Tt(u) < ib^u' + , S,/2a,, ) (2 + OmArt for all t > U with error probability el'^ , where bi'^ = ,_^^+^i € (1,4], 

Again we substitute the corresponding parameters in Lemma [T^] to give an improved lower bound that 
Tt{u) > (6f+'^ - („l'r/2co )(2 + OmArt for all t > U with error e|'+'\ where &[*+'' = iTi^ ^ (1/2, 1), 

Let C(0 = 7 + 40/C. Then r;''+^' = C{£,frf^ and ef+^^ < n^ef^ + 10n-i""+2. 

Now we show that for every i, 6; is strictly greater than 6; , i.e., the process gives better lower bound 
after every two consecutive steps. Then by the fact that b^ < 1, we have that {&; }i>i converges to 1. 
Similarly, it can be shown that the procedure gives better upper bound; namely, {bu }i>i is a decreasing 
sequence which converges to 1. In the following, we actually prove a stronger result that after each iteration, 
the distance between 6; and 1 decreases by a multiple factor, which guarantees that the {&; }i>i converges 
quickly to 1. 

We calculate the distance between b^ and 1, which gives that 



2 + ^-^ 



1+^ 



'^'-^ 



2 + g rk 

e(2 + 0&|'^ + l 
1 „ ^(0 



< 



C(l+^/2) + l- 

Therefore, the sequence {1 — 6; }i>i decreases by a multiple factor at least £q_i_//21+i ^^ each step. On 
the other hand, since Tt{u) > [1 — (1 — 6j ) — (i,^'„\ci ](2 + S,)7nArt, the best bound is determined by the 

r^^^ (i) . . 1/2 

maximum of ,, ' ,„, and 1 — 6, , which is at most ,f,, , /,„-, , ,s; . We terminate the iteration at the step 

(lnn)'=i / ' (4(l+4/2) + l)' ^ 

kn - r (ci/2co)ln(«r^) n , In „ • , • , 1/2 < '■j""' _ g(0"°° „„J 

'^0 - I in(c(£)2(J(l+J/2) + l)) 1 ^ 4 ' 1^ Wmcn case /.^ , . ,2'l + l~l''0 - fnr^-l-l/^-^o - |-«r2)<=l/2co ' ^'^'^ 



1 

(nr^)^ 



Prm(u)<(l- ^^^^^^J (2 + e)mA.t] 



where we used the assumption that C2 = Ci in('(g^£)2(£n4,£/2)j-i)) • 

The upper bound can be obtained similarly by noting that the sequence {&« — l}i>i decreases by a 
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multiple factor at least fco+fi+i — ffi+//2^+i ^^ each step. Hence, we have that 



Pr[\Tt{u) - {2 + £,)mArt\ > 



(nr2)'=2/2co 



I Art] < 



(28) 
D 



6.3 Power Law Distribution of the Generalized Models 

In this subsection, we prove Theorem UJ based on both the result and the proof of Theorem [31 

Proof of Theorem Rl Since the local-degree sequences in the hybrid model is exactly the same as the degree 
sequences in the base model, by Theorem |31 the local graph of G^ has the power law degree distribution. 

Now for the self-loop model, in which the degree of a node v can be expressed as deg((w) -I- 6, where 
degf(u) is the number of non- flexible edges incident to v at time t. Now we can write the recurrence as 
follows. 



E[dk+s{t + l)\Gt] 
dk+s{t)+ Yl '"^^ 



(fc- l+(5)l|:rt+i-i,|<r 



E ( 



toE 



Tt{xt+i) 



Gt 



v<£Dk+s(t) 



Tt{xt 



\Gt 



+1) 



0{mE[ijk{GuXt+i)\Gt]), 






(29) 



, where 



Solving the recurrence, we can also arrive at P^ . which gives the solution of the form 

(j)'i^{m,S) tends to a limit (f>'^{m,S) which depends only on to, J as A: — >■ oo. This finishes the proof that the 

degree sequence of the self-loop model follows a power law distribution. D 



7 Large Community and Small Expander 

In this section, we will prove Theorem 5. 

Before proving the result, we give a brief discussion on the choice of r. In the previous sections, we 

{\nny° for some sufficiently large constant cq. The base model as well 



considered the case when r 



-1/2 



as the two generalized models has the small-community phenomenon and the power law degree distribution. 
Now we consider other choices of r and show that if r is too small or too large, then there is a strong 
evidence indicating that the model does not have the power law degree distribution or the small-community 
phenomenon, respectively. 

When r is as small as r = n^^"^*^, for any e > 0, then every node connects only a very small fraction 
of neighbors and the whole graph is almost surely disconnected f jPen03| ). Furthermore, there are many 
isolated vertices in the base model in this range of r, which indicates that the base model is very unlikely to 
have the power law degree distribution. 

When r is as large as r = n~^/^+% for any e > 0, we have shown that the models have the power law 
degree distribution. However, the small- community phenomenon does not seem to exist in this situation. 
In particular, there exists an interesting division of the structure of the i?-neighborhood when R varies. 
Specifically, we have shown in [LP 11) that under this range of r, if i? = n^^^^^^ for any p > e, then with 
high probability, for any v, Cr{v) is an (a, /3)-community for some constants a,/3 of size 0(n^''), which 
indicates that every node belongs to some large community. Here we show that with high probability, for 
all R = o{r), and for any v G Vn, the conductance $(C_r(w)) of Gr{v) is larger than some constant, which 
indicates that the i?-neighborhood is not a good community. 

Now we give the proof of Theorem \5\ 
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Proof of Theorem\^ The first part of tlie theorem is given in |LPllj . Here we prove the second part. 

For some fixed R = o{r), we let C = Cfl(f) and C — Cr-R{v) for convenience. Then for any vertex 
u & C and u' G C", the distance between u and u' is at most r. The areas of Br{v) and Br-R{v) are 

&Yea{BR{v)) - R^/i 
area(B,_fl(v)) ~ (r - i?)V4 - rV4, 

respectively, which means that a uniformly generated point will land in Br{v) and Br-R,{v) with probability 
i?^/4 and r^/4, respectively. 

We will show that there are many edges between C'\C and C. To be more specific, let Ci (or C{) be 
the vertices in C (or C") that were born before or at time n/2 and C2 (or C2) be the set of vertices in C (or 
C") that were born after time n/2. We show that the sum of the number of edges e(Ci, C2) between Ci and 
C2, and the number of edges e(C2, C[) between C2 and C[ are large. 

Let f denote the event that for any u e 14, and for each t > to, Tt{u) < 8(2 + ^)mArt, then by Lemma[5l 
the probability that £ holds is 1 — 0(n^ '""). Now Conditioned on £, for any vertex Xj e C2, the probability 

that the i-th contact of x,- lies in Ci is at least ^" , i' > .,„ , A . > 1 / ■ Thus, \e(C-\,Cn)\ dominates 

\r^ I 
Si (to I C2 1, g. ), where Bi{N,p) denotes the binomial distribution with parameters N and p. 

Similarly, for any vertex Xj € C'2 , the probability that the i-th contact of Xj lies in C( is thus at least 
^"+^)l^il > i'+|\lf I > J^. Thus, \e{C2,C[)\ dominates Bt{m\C2\,^). 
Totally, the expected number of edges between the C and C'\C is 

E[|e(C, c \c)\] > -^j^ + 8^^„ ' 

which is at least ?7i|C|/16 conditioned on the event A that C[ and C2 are both of size at least Arfi/i. There- 
fore, by HoefFdings inequality and the fact that Pr[-i^] = 0(n~^), we see that \e{C,C)\ > \e{C,C'\C)\ > 
m|C|/32 with probability at least 1 - e-"l<^l/32. 

On the other hand, \C\ — o{Arn) with high probability. Therefore, 

Pr[3i? = o(r), 3«, |e(Cfl,(«),C,,,(«))l <™|Cfl,(«)l/32] < ^ T e-™'^/^^ ^ o(l). 



fe=i 



where the last inequality follows from the assumption that m > Klnn, for some large constant K. 
Finally we note that vol(Cii;(w)) < m|Cfl,(w)| + |e(Cfl(w), (7fl(?;))| and then we have 

m|6fl(w)| +TO|6fl(i;)|/32 

with high probability. This proves Theorem [5l D 

Finally, we remark that the above proof can be adapted to the two generalized models G^ and Cfj . Since 
the number of long edges is relatively small compared with the number of long edges, the effect of long 
edges do not change the community structure too much. Specifically, to show that for R = o{r), Cr{v) is 
an expander in G^ and G|, we just need to use that yo\{Cr{v)) < (m + l)|Ci?(w)| + \e{CR{v),CR{v))\, and 
\£{Cr{v),Cr{v))\ > m\C\/32, which follows exactly the same as above. 

8 Conclusion 

We investigate the small-community phenomenon in networks and give two models that unify the three typical 
properties of large-scale networks: the power law degree distribution, the small-community phenomenon and 
the small diameter property. The proposed network models provide us insights of how real networks evolve 
and may have potential applications in, e.g., wireless ad- hoc model and sensor networks. 
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We have shown that the choice of parameters is subtle if one wants all the three properties to coexist. 
The fundamental conflicts is discussed, i.e., the power law degree distribution generated by the preferential 
attachment scheme and the small diameter always lead to an expander like graph, while the small-community 
phenomenon corresponds naturally to anti-expander in some sense, which means that the conductance of 
many subsets of small size is of order o(l). Other reasons for such conflicts worth further investigation. 

Finally, our proof technique for the power law degree distribution is of its own interest and it partially 
solves the open problems in |FFV07a] et al. It is interesting to find other applications of this method, in 
particular, in the analysis of randomized algorithms and network modeling. 
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